An Implementation of Symbolic Aboutness Theory
نویسنده
چکیده
Today information can be globally shared via the Internet and can be accessible from anywhere in the world. The increasing complexity and size of the WWW urges the need of more effective mode for information processing techniques such as information retrieval and filtering, information summarization, topic segmentation, data mining and information discovery, etc. All of them can be fundamentally considered as informational inference processes. For example, the task of information retrieval is to determine whether a document is about (i.e., relevant to) a user’s information request as completely and as precisely as possible. Therefore, aboutness plays a crucial role in the modern information processing procedures. Most current investigations in the information processing areas, however, are heuristically driven and lack of more formal considerations based on a better understanding of aboutness. We argue that an investigation and formal modelling of aboutness informational inference would be greatly helpful to further improve the performance of information process systems by better making use of the deep semantics involved in these processes. In our previous study [Bruza et al 2000], we proposed a commonsense aboutness theory and a set of properties (in form of symbolic inference rules) of aboutness acceptable from a human reasoning perspective. In this report, we give a brief description of the symbolic aboutness theory, and introduce an implementation of the proposed symbolic aboutness theory – “Penguin” Aboutness Inference System. The experimental results show that the proposed aboutness rules are reasonable, and at the same time, provide us a deeper understanding of the advantages and disadvantages of symbolic approach. Moving down to a vector based conceptual level, which is underneath the symbolic level, would provide a solution to the major disadvantages of symbolic approachescomputational complexity and the frame problem.
منابع مشابه
How the symbol grounding of living organisms can be realized in artificial agents
A system with artificial intelligence usually relies on symbol manipulation, at least partly and implicitly. However, the interpretation of the symbols – what they represent and what they are about – is ultimately left to humans, as designers and users of the system. How symbols can acquire meaning for the system itself, independent of external interpretation, is an unsolved problem. Some groun...
متن کاملA commonsense aboutness theory for information retrieval modeling
Information retrieval (IR) can be viewed as a process to determine the “aboutness”, or sometimes “relevance”, relationship between information carriers (e.g. document and query). Thus, the concept of aboutness lies at the heart of IR. A better understanding of aboutness would lead to more effective IR systems. In this paper, we give a review of the status of current research on aboutness. It is...
متن کاملTowards Functional Benchmarking of Information Retrieval Models
To evaluate the effectiveness of information retrieval (IR) system, empirical methods (performance benchmarking) are widely used. Although they are useful to evaluate the performance of a system, they are unable to assess its underlying functionality. Recently researchers use logical approach to model IR properties so that inductive evaluation of IR could be performed. This approach is known as...
متن کاملAboutness from a commonsense perspective
Information retrieval (IR) is driven by a process which decides whether a document is about a query. Recent attempts spawned from logic-based information retrieval theory have formalized properties characterizing “aboutness”, but no consensus has yet been reached. The proposed properties are largely determined by the underlying framework within which aboutness is defined. In addition, some prop...
متن کاملDeciding Term Aboutness Probabilistically
Information retrieval is the quest to nd those information objects relevant to a given information need. Relevance is a diicult notion to deene operationally. As a consequence information retrieval mechanisms are typically driven by the decision of when one information carrier (e.g. a document) is about another (e.g. a query). As documents and queries are typically complex representations built...
متن کامل